CDS

Accession Number TCMCG073C37135
gbkey CDS
Protein Id XP_010523196.1
Location join(5042920..5043003,5043084..5043418,5043515..5043872,5043972..5044559,5044639..5044914)
Gene LOC104801586
GeneID 104801586
Organism Tarenaya hassleriana

Protein

Length 546aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268022
db_source XM_010524894.1
Definition PREDICTED: putative galacturonosyltransferase 2 [Tarenaya hassleriana]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyltransferase 8 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R05191        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K13648        [VIEW IN KEGG]
EC 2.4.1.43        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00520        [VIEW IN KEGG]
map00520        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0008194        [VIEW IN EMBL-EBI]
GO:0016740        [VIEW IN EMBL-EBI]
GO:0016757        [VIEW IN EMBL-EBI]
GO:0016758        [VIEW IN EMBL-EBI]
GO:0047262        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGTTCCCGATTTTGTCCATGGAACTTGGATAGGAAGAGGCGATAAAAATTTGGATGATACACCCGAGAAGTTGTATCAAAGGAACCTGAGACAAGAAAGACGAGAGAAGCGGGCCAATGAGCTGTTGCAAAAAGACGAAACTATTCAAGAACTTGAGGAAGCAGCCATTGTGCGGTCCAAATCTGTGGATTCTGCAGTGATAGGGAATTACACGATTTGGAAAAGAGAATACGGGAAACAGAACAACATCGAAGAAATAATAGGATTGATGCAAGATCAGAACATCATGACTAGAGTTTACGCTAGCATTGCAAAGATGAAAAATAAGCTTGTCTTGCGTGAAGAACTAGAAACACAACTATCGAAAAGCCTAGAAGCTCTGGAGGAAGCATCCATTGGCGTTGGTCTGCCGGAGAGAATCCCTGATATGATTAGAGCGATGGGCCAAGTTCTGTCTCGAGCAAACGAGCAACTATATGAGTGCAAGTTGGTCACAGGTAAACTGAGAGCAATGCTTCATACTGCAGATGAAGAACTGGCTGGCGTGAAAACATACGCGACTTTCTTGTCTCAGCTGGCGGCCAAAACACTGCCAGATGTTATTCACTGCTTGTCCATGCGCCTGAATCTAGAGTACTATCTCCTTCCACCGGAGATGAGAAAATTCCCTCGGAAACAGAACTTGGAGGACCCAGATCTTCACCACTACGCTATCTTCTCAGATAATGTAGTGGCAACATCAGTCGTCGTTAACTCCACCATCATGAACACCCAGGACACTTCAAGGCATGTCTTTCACCTGGTGACCGACGAACTCAATTTTGGAGCAATGAGTATGTGGTTTATACTGAATCCTCCAGGAAAGGCGACCATCGAGGTTCAAAGTGTGGATAACTTTACGTGGCTTAATTCATCATACTGTCCTGTTCTGAGACAGATTGAGTCAGCAGCGATGAAGGAGTTCTATTTCAAGACGGAGAGGTCAGAGTCTGTAGAATCGGGCACAGAGAGCCTAAAGTACAGGAACCCAAAGTACCTCTCAATGCTAAACCACTTGAGATTCTACCTCCCTGAGATTTTCCCAAAGCTGGAGAAAATCCTGTTTCTGGACGATGACGTGGTTGTTCAGAGGGATCTAAGTGCCTTGTGGTCAGTGGACCTCACGGGGAAAGTGAATGGAGCAGTAGAAACTTGTGGGGCGAGCTTTCATCGCTTTGACACGTATCTCAACTTCACCGATCCTCGCATTTCCAGCAACTTTGACCCGCAAGCATGTGGATGGGCGTACGGCATGAACATTTTCGACCTGAAAGAGTGGAAGAAGAATAACATAACAGACACCTACCACTACTGGCAAAACCTGAATGGGGAGAGGAGGCTGTGGAAGCTGGGGTCGCTGCCCCCCGGGCTGATAACGTTCTACAATCTGACGAAAGCGATAGAGAAAAAGTGGCACCTTCTTGGCTTGGGATATGACAAAGACATTGATCTGAAGGAGGTGGAGAACTCGGCAGTTATACACTACAATGGACACTTGAAGCCATGGACGGAGTTGGCAATACCCAAGTATCTCTCCTACTGGGCCAACTACTTCCCTTTTCACCACCCTTACCTCCGCGCCTGCGCCCTTTCCCTTTGA
Protein:  
MVPDFVHGTWIGRGDKNLDDTPEKLYQRNLRQERREKRANELLQKDETIQELEEAAIVRSKSVDSAVIGNYTIWKREYGKQNNIEEIIGLMQDQNIMTRVYASIAKMKNKLVLREELETQLSKSLEALEEASIGVGLPERIPDMIRAMGQVLSRANEQLYECKLVTGKLRAMLHTADEELAGVKTYATFLSQLAAKTLPDVIHCLSMRLNLEYYLLPPEMRKFPRKQNLEDPDLHHYAIFSDNVVATSVVVNSTIMNTQDTSRHVFHLVTDELNFGAMSMWFILNPPGKATIEVQSVDNFTWLNSSYCPVLRQIESAAMKEFYFKTERSESVESGTESLKYRNPKYLSMLNHLRFYLPEIFPKLEKILFLDDDVVVQRDLSALWSVDLTGKVNGAVETCGASFHRFDTYLNFTDPRISSNFDPQACGWAYGMNIFDLKEWKKNNITDTYHYWQNLNGERRLWKLGSLPPGLITFYNLTKAIEKKWHLLGLGYDKDIDLKEVENSAVIHYNGHLKPWTELAIPKYLSYWANYFPFHHPYLRACALSL